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Abstract 

We generalize the poissonian evolving random graph model of Q to 
deal with arbitrary degree distributions. The motivation comes from 
biological networks, which are well-known to exhibit non poissonian 
degree distribution. A node is added at each time step and is con- 
nected to the rest of the graph by oriented edges emerging from older 
nodes. This leads to a statistical asymmetry between incoming and 
outgoing edges. The law for the number of new edges at each time step 
is fixed but arbitrary. Thermodynamical behavior is expected when 
this law has a large time limit. Although (by construction) the incom- 
ing degree distributions depend on this law, this is not the case for 
most qualitative features concerning the size distribution of connected 
components, as long as the law has a finite variance. As the variance 
grows above 1/4, the average being < 1/2, a giant component emerges, 
which connects a finite fraction of the vertices. Below this threshold, 
the distribution of component sizes decreases algebraically with a con- 
tinuously varying exponent. The transition is of infinite order, in sharp 
contrast with the case of static graphs. The local-in-time profiles for 
the components of finite size allow to give a refined description of the 
system. 

1 Introduction 

Evolving graphs arise naturally in the modelization of communication net- 
works, but also of social organizations and biological phenomena : brain 
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formation, genetic regulations, etc. Until recently, quantitative data were 
scarce, but the situation is changing very quickly |[l|, ||, |l^. In many impor- 



tant cases the laws governing the evolution of the network are unknown but 
non deterministic, and the final number of nodes is rather large. This ex- 
plains why physicists (see the reviews ||l|, |] and references therein) have been 
developping recently random graph techniques in the thermodynamic limit 
to understand some peculiar features, the most salient being large degree 
distributions, that seem to occur in an unexpected variety of situations. 

Many cases of interest lead to oriented graph models, for which the in- 
and out-degree distributions can be governed by different laws. This was one 
of the basic observations in the study of the yeast genetic regulatory network 
presented in IQ: a single gene may participate to the regulation of many 
other genes - the law for out-degrees seems to be large -, but each gene is 
only regulated by a few other genes - the law for in-degrees seems to have 
finite moments. A biological interpretation for the asymmetry is that the 
few promoter-repressor sites for each gene bind only to specific proteins, but 
that along the genome many promoter-repressor sites are homologous. One 
of our aims is to work with a model that incorporates such an asymmetry. 

We shall follow quite closely the philosophy of p|, and in particular pay 
attention not only to global quantities, but also to local-in-time profiles. We 
feel that for evolving networks this is a crucial condition to extract relevant 
information, because as we shall see, global quantities (averaged over time) 
give a distorded view of the network. On the other hand, it is hard exper- 
imentally to access local quantities, either because the ages are not known, 
or because their consideration would reduce the statistics down to an un- 
acceptable level. But the amount of available information is growing very 
rapidly, and one can hope that local quantities will become accessible in a 
near future. 

The model we study is the natural evolving cousin of the static maximal 
entropy model with given in-degree distribution j^, |13|. Starting from a 



single vertex at time 1, a new vertex is created at each time step - so that 
at time t, the size of the system, i.e. the number of vertices, is t - and new 
oriented edges are created with specified probabilistic rules. An arbitrary 
probability generating function T encodes the parameters of the model in 
the thermodynamic (large t) limit. Precise definitions are given in the next 
section. 

Our main results are the following : 

The global and local in-degree distributions are given by T, see eq.(^, 
whereas the global out-degree distribution is geometric with average T'(l), 
see eq.(|5D. The local out-degree distribution is poissonian but age dependant. 
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eq.(^. In the case of a static maximal entropy random graph with the 
same in-degree distribution ([H, |l^), the distribution of out-degrees would 
be poissonian. 

The global structure of connected components is studied via a generating 
function which satisfies a differential equation, leading to recursion relations 
for the distribution of component sizes, see eq.(0). This is illustrated by 
analytical computations in the case when T{z) is a polynomial of degree 1. 

The general criterion for the absence of a component containing a finite 
fraction of the sites in the thermodynamic limit is that the average a = T'(l) 
be < 1/2 and the variance 7 = T'{1) + T"(l) - T'{lf be < 1/4. In that 
case the system contains components whose sizes scale like a power of the 
total size of the graph, see eq.(|19|); equivalently, the probability distribution 
for component sizes has an algebraic queue, see eq.([T8D. Above the threshold 
(when T'(l) > 1/2 or 7 > 1/4) this probability distribution is defective but 
decreases exponentially see eq. (pT|) . The boundary separating the percolating 
and non percolating phases is 7 = 1/4 and a < 1/2. In the percolating phase, 
but close to this boundary, 7 — 1/4 is >0 but small and the giant component 
is exponentially small, see eq.(pO|). This situation, somehow reminiscent of 
the Kosterlitz-Thouless transition, had already been observed in a variety of 
models [|, I, i. 

We compare the percolation criterion with the one that emerges from the 
study of a static maximal entropy random graph with the same in- and out- 
degree distributions as our evolving graph (|^ 0). In the static case, the 
growth of the giant component is generically linear close to the threshold. But 
we show that if T{z) leads to a percolating static graph, it is automatically 
percolating for the evolving graph model. The intuitive explanation lies in the 
inhomogeneities of the evolving graph : the environment of an old vertex is 
denser than what a static model produces. And indeed, the giant component 
profile close to threshold is very asymmetric, see eqs. 



In the thermodynamic limit, the finite components are trees, and we 
derive a direct enumeration formula to count for their abundance eq.(|l6D. 
This can be used to describe all local in time profiles of finite components, 
a result we also recover for one time quantities via a generating function 
aproach eq.([l3|). In the appendix, we give a proof of the equivalence of the 



generating function approach and tree enumeration, a question that was left 
opened in [||. 

We have confronted our analytical results with numerical simulations 
whenever possible, our preferred example being when T is a geometric dis- 
tribution. 

Aknowledgement : we thank Denis Bernard for a careful reading of the 
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manuscript and for his kind permission to include the results of section. |5.2| , 
that where obtained with him some time ago. 



2 The model 

The random graphs we consider are constructed according to the following 
rules : 

1. For t = 1, 2, ■ ■ we denote by Qt the set of simple graphs with vertex 
set Vt = i.e. the set of pairs (Vt^Et) where Et, the set of 
edges of the graph, is a subset of {{i,j),l<i<j<t}. We orient 
the graph (yt,Et) by saying that G Et is an edge from i to j : 
an edge always goes from an older to a younger vertex. Note that 
Qi = {({1},0)} contains a single graph, made of one vertex but no 
edge. 

2. An evolving graph is a sequence G = {Gt)^-^^ = {{Vt, Et))^^^, where 
Gt & Gt ior t > 1, and for t > 2, Gt-i is the induced subgraph of 
Gt obtained by removing vertex t and all edges adjacent to it. Stated 
differently, Gt for t > 2 is obtained from Gt-i by adding vertex t and 
some edges of the form {i,t), i < t. This model of evolution implies 
that knowing Gt is equivalent to knowing Gi, ■ ■ ■ , Gt-i, Gt. We write 
Q for the set of evolving graphs. 

3. In addition to these rules of construction, we put a probability measure 
p on the set of evolving graphs. If G is an evolving graph, we denote 
by Gt the number of edges arriving at vertex t in Gt (which is the same 
as the number of edges arriving at t in Gt' for any t' > t). We want 
to fix the probability distributions for the number of edges arriving at 
vertices 1,2, ■ ■ -, i.e. we impose p{{G E Q,Gt = k}) = Tt^k for a given 
sequence of probability distributions {Tt^k,t > 1,0 < k < t— l},rj ^ > 0, 
Ylk'^t,k = 1- Since this criterion is far from fixing unambiguously the 
probability law, we shall only be interested in the one which doesn't 
introduce any other bias, i.e. the one which maximizes entropy. Then 
a simple computation shows that for Ht in Qt, 

p{{Geg,Gt = Ht})= n 

l<s<t \Hs) 

Note that we did not worry about the size of the sequence defining an 
evolving graph. In fact, we shall as often as possible suppose it is infinite. 
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3 Notations and basic results 



3.1 Generating functions 

For any vertex v > 1, let T„ be the generating function for the incoming 
degree distribution : 

v-l 

Tv{z) = r^,kz'' 

We shall be mostly interested in the thermodynamic limit, i.e. in t indepen- 
dant features of the large t behavior of Gt when G is a random element in 
{Q,p). A typical example is the behavior of lim^^oo 1-^*1/1^1 [|- Obviously, 
this limit does not exist for any a priori choice of the T^ys. The study 
of thermodynamic convergence for this quantity and others would be of in- 
dependent interest, but we shall see that a simple assumption ensures that 
many quantities of interest have a thermodynamic limit : we impose that the 
sequence of functions Ty{z) converges to a probability generating function 

nz)^j:'^z' (1) 

fc>0 

in such a way that the sequence of averages and variances converges to that 
of T(z). 

This property implies in particular that, for any k, x„ ^ tends to ^ for 
large v. Moreover, we will often use the fact that T(l) = 1. We use the 
notations a = T'(l) and 7 = T"(l) + T'(l) — T'(l)^ for the average and 
variance of T. 

As a technical hypothesis, to avoid several pathologies, we shall always 
implicitely assume that Tq = T(0) > 0. 

3.2 A preliminary formula 

We shall sometimes be lead to consider situations in which a new vertex t + 1 
appears and connects to the rest of the graph avoiding m forbidden vertices. 
This will happen, for instance, if one is interested in the distribution of 
connected components in the graph. In this section we would like to give a 
general formula for such situations. Hence, let at+i^m be the probability for 
a new vertex t+1 not to connect to m given vertices. This probability is 

^Here and in the sequel, IS*] denote the number of elements of the finite set S. 
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given by the formula : 

ft-m\ 
_ V n ) 

Oit+l,m = / ,n+l,n n\ 

n>0 \n) 

n-m\ 

With at+i,m{z) = J2n>o^t+hnZ'^^^jry-, we notice that = at+i,m{z)- 

jz^o>t+i,mi^)- Since at+ifiiz) = Tt+i{z), we have : 



(m\ 2 ("')z'^ 



t{t 



In the thermodynamic limit, we shall only need the large t finite m ap- 
proximation 

am 

at+i,m - 1 —■ (2) 

These simple formulae may now be used to calculate a few quantities. 
We'll first describe the degree distributions of incoming and outgoing edges, 
whereas the two following sections will present the main relations governing 
the distributions of connected components of the graphs. 



4 Degree distribution 

4.1 Notations 

Let 

1. lj{t) (resp. lj'{t)) be the number of incoming (resp. outgoing) edges 
at a vertex j at time t 

2. Vfc(t) (resp. v^(t)) be the number of vertices with k incoming (resp. 
outgoing) edges at time t. 

With these notations, the edge distributions are described by the generating 
functions : 

0<k<t l<j<t 
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In the thermodynamic limit, it is possible to give an expression for these 
generating functions. 

4.2 In-degree distribution 

Up to a normalisation factor, the probability for vertex j to have k incoming 
edges is Tj^^.This shows that 

k>0 

In the thermodynamic limit, this becomes 

(^'7W)=T(z) (3) 

Hence, in the thermodynamic limit, the probability for any vertex to have k 
incoming edges is given by ^, which is correctly normalized. 
This implies in particular that, in the thermodynamic limit, the average 
number of edges coming to a vertex is precisely a. 

4.3 Out-degree distribution 

Any edge emerging from a vertex j can be seen as an edge arriving at a vertex 
younger than j. Let t > j he the time at which one observes the number of 
edges emerging from j. We define a = j/t. The quantity is given 

by the formula : 

E'-' E n ( E r,..^'&l] n ( E 

k at<ji<-<jk<tl<i<k \0<k'<ji-2 \k'+V / j<j'<t;j'^ji \o<k'<j'-2 \ k' ) / 

Indeed, the probability for j to have k outgoing edges is obtained by summing 
over ji-i - • ■ iJk > 3 the probabilities for j to be linked to these k vertices and 
to no other vertex. Each of this probabilities can easily be calculated from 
the following relations : 

j'-i 



0<fc'<ii-2 Vfc'+l/ 



k' 



0<k'<j'-2 



r-2 
k' 
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For fixed a and large t, the second product simplifies to a" and the average 
we look for becomes : 



(T"^2'=^[lnt-lnat]''a^ 



Hence, for fixed a, 



for large t, so the local out-degree distribution is Poissonian with cr-dependent 
parameter a In a. Integrating a between and 1 yields the asymptotic value 
for the generating function i^^{z) : 

at large t. 

Identifying the term of degree in 2; in the development of this function 
finally yields the probability for a vertex to have k outgoing edges : 

(1 + a) 

We see that the distribution of outgoing edges is geometric and depends on 
the probability distribution T only through the average number of incoming 
edges a. 

4.4 Mixed distribution 

In our model, the number of in and out edges ifit) and Ijit) at a given 
vertex are independent by construction. The generating function for the 
mixed degree distribution is 

k+,k- l<j<t ^ 



which is easily obtained from eqs.(^H) 



The local counterpart would easily follow from eqs.(|^,|) 
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5 Connected components 



In this section we give formulae for the number of components of size k 
in the thermodynamic limit (paragraph |5.1|) , and for the time distribution 
of these components (paragraph |5]^). This derivation makes some natural 



assumptions of self averaging. In paragraph |5.4 we evaluate without such 
assumptions the contribution of individual graphs to the weights of random 
graphs, and show that trees dominate the thermodynamic limit. We use this 
result in the appendix to compute directly the generating function for the 
number of components of size k, and show that it coincides with the one 
given in paragraph ( p. 11) . 



5.1 Global-time results 

Let Nk{t) be the number of connected components of size k at time t for a 
given graph, and Nt{z) the corresponding generating function : 

k>l 

To evaluate this function, we shall investigate its evolution between times t 
and t + 1. When, at time t + 1, the new vertex is added and randomly con- 
nected to some of the older vertices, the components to which these vertices 
belonged are destructed and collapsed into a new, bigger, component. More 
precisely, let us put nfc(t) for the number of components of size k connected 
to vertex t + 1 when it is added to the graph. For all k, Ukit) components 
of size k are thus destroyed between times t and t + 1, while a component of 
size 1 + Ylik ^''^kit) is created. To summarize this : 

Nt+i{z) - N,{z) = -J2Mt)z' + zi+S^'^^^W (7) 

k 

Now, the point is to average this equality over all graphs of size k and all 
possible connections of vertex t + 1 with these graphs. 

First of all, we shall propose a compact expression for valid for 

all t. We will deduce from it an expression of {rikit)) before calculating the 
average 

Qf ^i+Efcfcnfe(t)_ Finally we will give an evolution equation valid in 
the thermodynamic limit. 



Exact derivation of 



•^'^^ This average is given by 
^^^^z^wt+iikj), 

Gt j>0 
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where Wt+i{k,j) is the total weight of graphs of size t + 1 obtained from 
a graph Gf of size t in such a way that, when it was added, vertex t + 1 
connected to j components of size k. Hence, Wt+i{k,j) is the term of degree 
in a; of the Laurent serie : 

-.§if--(T)(s-'e))'5-'(-':"" 

In this expression, 

• n is the number of edges connecting vertex t + 1 to Gt ; 

• i is the number of edges connecting vertex t + 1 to a given component 
of size k ; 

• p is the number of edges connecting vertex t + 1 to vertices of Gt which 
don't belong to a component of size k. 

The sums over i, j and p simplify, so that (^z"-''^^^) is the term of degree in 

Gt,n>0 \n) 

If we rewrite the sum over the G^'s as an average over graphs of size t and 
expand the two parenthesis, we get 

Tf+l,n 



\n,m>0 \n) ^ 



ml \ n 



In this formula, we recognize cct+i^^ = Yl:n>o ^m" (* i which is the 

probability that vertex t-\-l doesn't connect to m given vertices. The average 
we need becomes : 

m>0 \ \ / / 



This formula gives in particular the average number of components of size 
k connected to the new vertex : 

{n^it)) = {N^{t)) (1 - at+i,k) (9) 
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Exact derivation of <^2;Sfe"ft(t)^ ^ji^e only difference witli the computation 
we've just described is tliat tlie sum over j is replaced by sums over all 
possible jfc's, while becomes z^Zt^ifc Omitting the details, we just give the 
result, which has the same form as formula (||) : 



{m} 



In this equation, the sum is performed over all sequences mi,m2, ■ ■ •. 

This expression can be simplified if we note that at+i,m is a weighted sum of 

Newton coefficients, so that 



n>0 ' 



where the symbol § denotes the contour integral J along a small contour 
surrounding the origin. 

Evaluating the sum over the m^'s now yields 



Evolution equation for connected components Formula (0) can now 
be written in a simpler form using equalities (|) and (|ToD. With T((l) = 1, 
this leads to : 

{N,^,{z)) - {N,{z)) = J2{N,{t)){a,^,^,-l)z' (11) 

k 

n \nJ 

This equation is exact for finite t. To get the equation governing the large 
t behavior, we make a thermodynamic assumption : because Tt{z) converges 
to T{z) in such a way that the averages and variances converge as well, we 
expect that {N^it + 1)) — {Nk{t)) converges to some deterministic quantity 
Ck, so that {Nk{t + 1)) — {Nk{t)) — Ck may fluctuate but is o(l) at large t. 
Then, by Cesaro convergence, the same is true of {Nk(t)) /t — Ck- 

Then the left-hand side of eq.(0) is ~ ^k^kz'^- Using at+i^k ~ 1 - ^ 
for large t, the first sum on the right-hand side is ~ —a^^j^kCkZ^ . In the 
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second sum, we replace x by x/t, leading to 



) 



For large t, ^i^^ ~ r„ and [z^ (1 + + 1 - z'^Y"^''^ ~ (l + xfc^V^)^'^*^ 

^xkz^Nk{t)/t ^ The averages are now trivial to take. The second sum is seen to 
be 

n+l 



X 



n 

Putting this together, we get 

u u ^ ^ 



k k 



X 



Defining C{z) = CkZ and evaluating the contour integral, we obtain the 
compact formula 

C{z) = -zad,C{z) + zT {zd,C{z)) (12) 

which describes the number of components of size k at large t. 

This equation has a simple interpretation in terms of destruction and 
creation of components. Indeed, C{z) = Nt+i{z) — Nt{z) gives for each k the 
variation of A^^ between t and t + 1. This variation is the sum of two terms : 

• —zadzC{z) gives the product of the average number, a, of incoming 
edges in a given vertex by the mean proportion of vertices in compo- 
nents of size k ikCk)- Stating that in the thermodynamic limit, two 
edges arriving at a vertex have vanishing probability of coming from 
the same finite component, this term can be seen as the average number 
of destroyed components. 

• zT {zdzC{z)) involves a sum of terms of the form "^"^^^'^^i'" kiCk^ ■ ■ ■ knCk 
It counts the average size of the component created when a new vertex 

is added. 



Equation ([T2[ ) allows a recursive calculation of the C^'s. Unfortunately, 
we were not able to find an explicit formula valid for all k and we only give 
here the values of the first few of them : 

^0 



c. 



1 + a 

Ton 



(l + a)(l + 2a) 
4rorf (g + 1) + T^T2{2a + 1) 
2(a + l)2(2a + l)(3a + l) 
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5.2 A simple example 

The case when T{z) = {1 — p) + pz leads to tractable equations even at 
finite t. As we shall see later, this example is pathological from the point of 
view of the percolation transition. This is another reason to give a separate 
treatment. 

Though it is likely that this toy model has been solved more than once 
before, we have found no reference. So we give a sketch of the solution with 
apologies to the original contributions. In particular, we compute the scaling 
function governing the statistics of large components. 

When vertex t + 1 is added, it remains isolated with probability 1 — p in 
which case iVfc(t + 1) — N^it) = 6k^i, or is attached to a component of size / 
to build a component of size / + 1 with probability plNi{t)/t, in which case 
Nk{t + 1)- Nk{t) = Sk,i+i - 5k,i. We infer 

{Nt+,{z)) - {Nt{z)) = {l-p)z + Y,pl{Ni{t))/t{5k,i+i-5k,i)z^ 

k,l 

= {l-p)z + pY,HNi{t))/t{z^^'-z') 
I 

= {l-p)z + p/t{z^-z)d,{Nt{z)). 



The initial condition is {Ni{z)) = Ni{z) = z We can simplify this equation 
by the change of variable z = w/{l + w). Setting Qt{w) = {Nt{z)), the 
equation for Qt is 

yj nj 

Qt+1 - Qt = (1 - p)— p-dyjQf 

l + w t 

Nt{z) is a polynomial in z, so Qt+i has a regular series expansion in w, 
Qt = J2k>iQkit)w'' which leads to Qkit + 1) = ^gfc(t) + (1 - 
Direct substitution shows that qk{t) = tY^^{~^)'^^^ is a particular solution. 
The general solution rk{t) of the associated homogeneous equation rfc(t + l) — 
^^Y^rk{t) is rk{t) = r(tjr(i-lk) ^fe (-*-)■ Taking into account the initial condition 
Qi — w/{l + w) leads to 

. (f) - (fl^ + + Tjt-pk) \ 
Qk[t)-[ ij yi+pk^ 1+pk r{t)T{i-pk)J- 

We can now go back to the z variable : 



(Nkit)) - E(-i)'^' (4 

;— 1 V 



p ^p{l + l) T{t-pl) \ T{k) 



1+pl 1+pl T{t)T{l-pl) J T{l){k-l) 
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The contribution corresponding to the first term inside the parenthesis, the 
one which is hnear in t, can be resummed. Indeed, if C{z) is a solution of 
C = (1 — p)z + p{z^ — z)dzC, tC{z) solves the original equation for {Nt{z)) 
even at finite t. The solution which is regular at 2; = is 

c(z) = V i-pr(fc)r(i + i/p) ^,, 

t^i p nk+i+i/p) 

One can check that, for any p G [0, 1[, C'{z = 1) = 1. As we shall explain 
later, that means there is no percolation. 
To summarize, 

(N (f)\ = i-pr(fc)r(i + i/p) ^ + T{t-pi) T{k) 

^ '^^^ p r{k + i + i/p)^^^ ^ i+pi r{t)T{i-pi)T{i){k-i)\ 

It is now possible to get the exact scaling function governing the size 
distribution of large components. When both t and k are large, the ther- 
modynamic contribution scales like ^^r(l + 1 / p)tk~^~^^P while in the other 

contribution, for fixed /, one finds {—lY^^ ^i^pi r(i-pOr(0 ^~^'^'~^" '^^^^ 
creases very fast with I so for large k one can extend the range of Z to 00. 
The balance between the thermodynamic and finite size contributions shows 
that the scaling variable is s = kt~P. In the scaling limit 



{kN,(t)) ~ i^r(i + i/p).-v. + E(-i)'"' 1'+'"plV(i-'-- ^ 



1>1 

Let S{s) denote the scaling function on the right hand side. 

1 

r(i-pOr(0 



pinif' 



Defining sA{s) = J2i>i{~^y^^ rd-^Drm ^'^ ^ rewrite 



S(s) Is = ^-^r(l + 1 /p)s-^-^'P + ^(s) - /' d\A(\s)\'''^ . 

P P Jo 

This expression of 5" exhibits clearly its small s behavior. To get con- 
trol on the large s behavior, one can use the familiar representation = 

^ dwe'^w~^ ioY z = 1 — pi and sum over / to obtain 

Ais) = — [ dwe'"-''"''wP-\ 
^ ' 2i7r Jc 

Observing that r(l -|- l/p)s~^~^/P — d\A{\s)\^l'P ^ one gets a compact 
expression 

S{s) /s = A{s) + ^—^ r d\A{\s)\^/P. 
P Jl 
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Figure 1: Plots of S (s) / Sthermo{s) for p — 0.2 (solid line), p = 0.5 (dots) and 
p = 0.6 (dashed line) 

The large s expansion can then be obtained by standard methods (saddle 
point for A(s) and then analysis at A = 1 for the second term). The formulae 
are rather cumbersome and we simply quote the leading exponential behavior 

log5'(s)~ ^pT^igT^i s ^ +00, 

p 

showing that S{s) decreases very fast at large s. As a simple example, take 
p = 1/2. Then A{s) = ^e'^'/^ and 

Returning to arbitrary p, the relation {kNk{t)) ~ Sik/fP) implies that 
the average number of components of size k > sf/ is {J2k>stp -^kit)) ~ 
duS{u)/u < oo. So the large components have a size of order t^. 

5.3 Local-in-time results 

Knowing the global-in-time distribution of connected components, it is natu- 
ral to wonder whether this distribution is homogeneous in time or not. More 
precisely, if j — at with fixed a and i — > oo, what can we say about the 
probability for vertex j to be in a component of size k ? 
For /c > 1 let Pfc(c) be the probability that, in the thermodynamic limit, 
vertex at belongs to a connected component of size k. The purpose of this 
section is to give an equation governing the p^s. 
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Local-in-time equation The systematic to derive this equation relies on 
the same ideas as those used in paragraph to get the global formula, and 
involves heavy formulae. Consequently we use directly the thermodynamic 
assumption and give a more intuitive argument. 

Let At be an interval of time such that At 3> 1 but ^ ^ 1. Between times 
t and t + At, the number of vertices added to the graph is large but much 
smaller than the size of the graph. 

Following the interpretation of eq. ([12D, the number of new components of 
size k is ^ 



Each of the m old components has average time distribution so that, on 
average, the contribution of the At new components to the time distribution 
of components on k vertices is 

Let us put p{<J,z) = J2k Pk{'^)^^ ■ Multiplying the expression above by 
and summing over k yields the term Atz^dzp{<J, z)T\zdzC)da. 

Following again the interpretation of eq. ([12D , aAtkCk components of size 
k are destructed between t and t + At. According to the definition of Pk{o'), 
the number of vertices of relative age contained between a and a + da, which 
belonged to one of these components is aAtkpk{cr). 

After summation over k, the destruction term is aAtzdzp{cr, z)da. 
Moreover, a vertex of relative age a at time t has relative age -j^^ at time 
t + At. Hence, the local-in-time profile verifies, to first order in At/t, the 
following relation 

tda -f^At ' ^ ~ P^^^^)^ ~ —CiAtzdzp{(T, z)da+Atz^dzp{(T^ z)T' {zdzC)da. 

This leads to the differential equation 

ad^p ={a- zT' {zdzC)) zdzP (13) 

As in the case of a Poisson law, this equation leads to recursion relations 
for the pfc(cr)'s. Differentiating eq. ( [T^) k times with respect to z and tak- 
ing z = leads to a first order linear differential equation for p^, in which 
pi, ■ ■ ■ , pk-i appear. Putting x = a", the first few distributions are : 

Pi = TqX 
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Figure 2: The analytic result (solid lines) for the profiles of small connected 
components (from top to bottom k=l,2,3) compared to numerical simulations 
(gray clouds) on 5000 random graphs of size 30000. T{z) = (1 — p)/(l — pz) 
with p = 0.2. 
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Component distribution at a = 1 To conclude this paragraph, we shall 
compute the probability for the oldest vertex to belong to a component of size 
k. This is not difficult to do because, unlike all the other vertices in the graph, 
the youngest one does not have any outgoing edge. Hence, following once 
again the interpretation of eq. (|12|), it belongs to a component of size k with 
probability ^ Efci,...,fc„ hCk, ■ ■ ■ k^Ck^ More compactly : 

p{a = l,z) = zT{zd,C). (14) 

Note that eqs. ([T3| , p!^ can be used to recover the global equation (|12|). 
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5.4 Tree distributions 



Given a connected graph G, one may wonder how many connected compo- 
nents of the random evolving graph are isomorphic to G. In other words, if 
k is the number of vertices of G, we look for the average number of increas- 
ing maps V : [1, ■ ■ ■ , /c] [1, - ■ ■ ,t] such that the vertices vi, - ■ ■ ,Vk span a 
connected component of the random graph isomorphic to G. 
Let rrii be the number of edges incoming to vertex i in G. The probability 
that vertices vi,---,Vk span a connected component of the random graph 
isomorphic to G is evaluated using the following two rules : 

• Vertex Vi has rrii incoming edges coming from the given vertices ; 

• The vertices w of the random graph which are not in the image of v 
must not be connected to any of the fj's. 

Hence, putting v^+i = t + 1, the probability we look for is 

1 



n 



(Vi-l\ 



n E 



J-l / ^ 

\ rUi ) Vi<w^<Vi+i j>0 V i / 



Wi - I 
J 



The average number of components is then obtained by summing this 
expression over all increasing maps v 



i-o) = E n 

l<vi<---<Vk<t i=l 



\ rUi J Vi<Wi<v^+i j>0 \ j ' 



Wi — i — 1 
3 



In the thermodynamic limit, the sum over j is given by formula (|2D, and 

-ia In^iiil 

its product over Wi tends to e provided only large Wj's contribute 

significantly. Hence, approximating the sum over Vi,---,Vk by an integral 
yields a contribution of G equal to 



mi , 



,m.k 



t 



k—m 




a— mi 



(15) 



0<o-i<---<o-fc<l 



This formula looks pretty much like the one proposed in the poissonnian case 
in 0. In particular, it shows exactly in the same way that only connected 
graphs with k = m + 1 (i.e. trees by Euler's formula) give a contribution 
that scales like t in the thermodynamic limit. Moreover, the contribution of 
a tree with degree distribution rrii is 



n 



i{a + 1) - (mi H + rrii) 



(16) 
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This formula was obtained by integration over all relative ages ctq, ■ ■ ■ , Ck-i. 
However, if we only integrate over some of these variables while fixing the 
others, say at^ < crk2 < ■ ■ ■ < Ckp, we expect to get the contribution of a 
given tree with k vertices amongst which p vertices have imposed age. For 
instance, taking k = 2 and integrating over ai while imposing a2 = <J gives 
the contribution of trees of size 2 with younger vertex of age a. Explicit 
integration yields a contribution rori(T^"/(a + 1) . On the other hand, if 
we integrate over a2 and fix ai = a, we get TQTi{a°'/a — a'^°'/a) for the 
contribution of trees of size 2 with older vertex of age a. The sum of these two 
quantities, which is expected to be the probability that a site of relative age 
a belongs to a tree with two vertices is, indeed, equal to p2(o") as calculated 
from equation (|13|). 

In fact, this result is not really surprising, since the derivation of eq.([T2[) relies 
on the fact that, in the thermodynamic limit, a new vertex connects with 
vanishing probability to several vertices in the same component. In other 
words, this equation of evolution takes only trees into account so eq.([l6[) 
should imply it. The full proof is instructive but tedious, and we relegate it 
to the appendix. 



6 The percolation transition 

Formula (0) gives a relation between the C^'s, which represent the asymp- 
totic number of connected components of size k. This equation involves in 
particular the function zdzC{z) = J2k kCkZ^. 

Note that kCk is the fraction of sites belonging to components of size k. 
This means that, if = 1, no single component in the graph can have 

size 0{t). On the other hand, if Ylik^^k < 1, the possibility exists that a 
giant component contains a finite fraction of the sites. The possibility that 
several very large components coexist is usually ruled out because under a 
rearrangment of a number o{t) of edges these components would merge with 
finite probability. Though we have not tried to build a formal argument in 
the case of evolving graphs, the intuition remains the same and is confirmed 
by numerical simulations. So we take for granted that if kCk < 1 a single 
giant component contains a fraction 1 — Ylik of the sites. 

6.1 Main results. 

The arguments that lead to the main qualitative and quantitative features 
of the percolation transition are of technical nature. So we postpone them 
to the next section. 
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Percolation criterion. Unless T{z) is a polynomial of degree 1, the system 
contains a giant component if either the variance 7 of T is > 1/4 or the 
average a of T is > 1/2. The boundary separating the non percolating 
phase from the percolating phase is given by the two conditions 7 = 1/4 and 
a < 1/2. The emergence of a giant componant is purely due to an increase 
of the variance of T above the threshold 1/4. 

Behaviour below threshold. If 7 < 1/4 and a < 1/2, the distribution of 
component sizes in the system is critical, caracterized by the following three 
equivalent properties. 

The dominant singularity of C{z) at z = 1 is 

C'^'^z) (X {1 - z)^^^ . (17) 

For large k, the fraction of sites belonging to components of size k de- 
creases like 

kCk oc fc"i-vi-47. (18) 

For a finite system of size t — > 00, the large components have a size of 
order 

k{t) oc t'-""^ . (19) 

Behaviour above but close to threshold. The percolation transition is 
of infinite order when 7 crosses the value 1/4 while keeping a < 1/2. If we 
denote by Pqo the fraction of sites occupied by the giant component, then 
when 7 — 1/4 —i> 0"*", Poo is exponentially small : 

log Poo ~ -7r/v/47-l for 7 ^1/4+. (20) 

For large k, the fraction of sites belonging to components of size k de- 
creases like 

kCk oc A;-ie-'=^°°. (21) 

6.2 Discussion. 

We start our discussion by analysing the behavior of C{z) close to 2; = 1. 
As before, we assume that T{z) is not a polynomial of degree 1, i.e. that 
T"(l) ^ 0. 

We start from equation (^) and apply the operator zdz to get 

{zd,C) + azd,{zd,C) = zT{zd,C) + zT'{zd,C)zd,{zd,C) (22) 
which involves only zdzC. 
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Set z = and Y{t) = 1 — zdzC . Equation (p2D can be rewritten for Y : 

= Y{t) - 1 + aY{T) + (t{1 - Y{t)) - T'(l - Y{t))Y{t)^ . 
We know make two assumptions. 

i) there is no giant component : kCk = 1 or equivalently Y{0) = 0, 

ii) the size distribution of clusters has a first moment : /ii = k^Ck is 
finite. 

Note that ii) implies that Y{0~) exists and has value — /ii. 

We differentiate the equation for Y with respect to r and put r = 0, 
yielding : 

/i?T"(l) +/ii(2a- 1) + 1 = 0. 

The discriminant of this equation for /ii is 1 — 47 where 7 = T"(l) + — 0;^ 
is the variance of the distribution T. 

The case when T"(l) = 0, i.e. when T{z) is affine, has some pathologies, but 
it has already been treated in detail. 

If T"(l) > 0, the quadratic equation has two roots, 

a) both real and positive if 1 — 2q; and 1 — 47 are positive, 

b) both real and negative if 1 — 2a is negative but 1 — 47 is positive, 

c) both complex if 1 — 47 is negative. 

Clearly, only case a) is compatible with our two assumptions i) and ii). In 
the sequel we shall take for granted that in this case, the assumptions i) and 
ii) are indeed true. 

In cases b) and c), at least one of the assumptions must fail. We show 
that it is i), the absence of giant component. To do that we need a more 
precise analysis. 

Let us first give some properties of F(r) for r < 0. By construction, Y{t) 
has a convergent expansion in powers of with negative coefficients (except 
the first) and is bounded by 0. So F is continuous decreasing on ] — cxd, 0]. 



To obtain the large order behavior of Ck we simplify eq. (|22D assuming that 
r and Y{t) are small|. This is certainly a good approximation to describe 
the small r behavior of Y{t) when there is no percolation cluster because in 
that case Y{t) is continuous and vanishes at r = 0. It is also true close to a 
continuous phase transition because Y{0) is small. 

Keeping only the dominant contributions yields 

Y (T"(l)r - ar) + (1 - a)F + r ~ 0. 

Setting 7F = T"{1)Y — ar, we derive the limit equation 

jFF + F + T = 0. (23) 
^Note that we do not assume that they are of the same order of magnitude. 
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This equation is the same as the one found and studied in P, |2|. As the 
presentation in ||2| is closer in spirit to this one, this is the one we refer to in 
the sequel for details. In the poissonian case, the authors showed carefully 
that eq. (|2^) indeed contains the quantitative universal features of the exact 
cluster generating function. We take for granted that this remains true for 
general T. 

When 7 < 1/4, we write j3 = ^^^^-47. The general integral of eq.(^) is 

Suppose F{0) = 0, but F{t)/t is not bounded close to r = 0~. Then at 
a point where F{t)/t is large, eq.(|2^ implies that F{t) is of order one, a 
contradiction. For analogous reasons, if F{0) = and F{t)/t is bounded, 
lim^^o- 1 + ^7 = and then 

2 i+i 

^ ^ 1 + 13 ^ ' 

To summarize, if 7 < 1/4 and F{fi) = 0, F'{0) exists (and then /ii = 
/c^Cfc is finite) so that i) implies ii): this means that if 7 < 1/4 but 

a > 1/2, i) has to be wrong, and kCk < 1. 

A word of caution is needed here. To get the limiting equation (p3D , 

we have neglected terms of order in (^2]). So strictly speaking, eq.(|25|) 



IS correct only if <2, i.e. 2/9<7<l/4. A more careful analysis, 
analogous to that sketched in ||2|, would show that in general, in the absence 
of a giant component, the small r expansion of F{t) starts with a standard 
taylor series in r up to order and then a leading singularity proportional 

1+13 

to ri-'3. 



When 7 < 1/4, we write (3 = 1/47 - 1. Eq.(|2l) implies that 



1 log(7F^ + tF + t')-^ arctan (j^^ ■ (26) 

is locally constant. As F is continuous, the above quantity jumps by ±7r 
when F + 2t changes sign. To fix conventions, we specify the function arctan 
by demanding that it is continuous and takes value in]— 7r/2,7r/2[. We argue 
by contradiction that F cannot vanish at r = : if F and r are small, the 
argument of the log is a small positive number so the first term gets large 
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Figure 3: Phase structure of the evolving graph in the subsimplex = ps = 
Pi = ■ ■ ■ , Po + Pi + P2 = 1- The unshaded area corresponds to the non 
percolating phase. Horizontal lines fill the region 7 > 1/4 and vertical lines 
the region a > 1/2. 



and negative, while the term involving arctan remains bounded. So again i) 
fails. 

We have established the percolation criterion announced in the previous 
section. 

Furthermore, the relation 7 — 1/4 = T"(l) — (2a — 1)^/4 shows that the 
region separating the percolating phase from the non percolating phase is 
7 = 1/4 and 2a — 1 < : the emergence of a giant component is purely due 
to an increase of the variance of T above the threshold 1/4. This is a bit 
counterintuitive, because it implies that there are cases with arbitrary small 
a and a giant component. 

To illustrate that point, we present the case when T is a quadratic poly- 
nomial. 

The properties that describe the system when there is no giant component 
or close to the threshold (when the giant component is small) only rely on 
equation eq.([2^), and we refer to jH for the detailed analysis that leads to 
eqsM 



6.3 Comparison with static graphs. 

Maximum entropy static graphs with fixed in- and/or out-degree distribu- 
tions have been studied in ^ |13|. If only the in-degree is fixed, the out-degree 
is always Poissonian, in contrast with the global geometric out-degree distri- 
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bution of the evolving cousin model. Let us compare briefly the percolation 
criteria of the evolving case with two static maximum entropy situations. 
If T^{z) = T{z) is the in-degree and T^{z) the out-degree distribution of 
the static model, we read from |||, [l^ that the condition for percolation is 



T"(1)T^(1) - (T'(l) - T'(l)2)2 > When only the in-degree is fixed in 
the static case, T+ is Poissonian, one finds 7 + 0; > 1. When moreover the 
out-degree is fixed to be independent of the in-degree and follow a geomet- 
ric law, so that the static graph has the same (global) degree distribution 
as the evolving graph, one finds 27 + 0;^ > 1. The thresholds are different 
in the static and evolving cases. Moreover, the percolating region of the 
evolving case always contains strictly the static percolating region, because 
when 7 < 1/4 and a < 1/2 both 7 + a and 27 + are less than 3/4 : the 
inhomogeneities of the evolving graph, in which old vertices have an effective 
high coordinacy, favor the emergence of a giant component. However, this 
giant component starts with a very tiny size, in contrast with the static case, 
when its growth is generically linear. 

6.4 Comments on the profile of the infinite cluster in 
the percolating phase. 

For a Poissonian in-degree distribution, the authors of obtained a closed 
equation which fitted perfectly with numerical simulations. The naive adap- 
tation of their argument to the general case is straightforward, but gives an 
incorrect result. 

Though we have not been able to derive a closed equation for the profile 
of the infinite component in the case of an arbitrary degree distribution, the 
successive derivatives of this profile at a = 1 can be computed in a systematic 
way as follows. 

Defining D = zd^C, we can derive from the previous results that 

crda-p = (a — zT' (D)) zd^p 
zT{D)-D = {a- zT\D))zd,D 
p{a=l,z) = zT{D). 



The first equation is just a rewriting of eq. (|l3|) , and the second one was 



obtained by applying the operator zdz to eq. ([121) .The last equation gives the 



density at a = 1. Then the first equation can be used to get the derivative 

^This is, as should be the case for a static graph, symmetric in T and because 
r'(l) = is automatic : any edge is in at one end and out at the other. 
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of the density at cr = 1 



ad^p{a = l,z) = {a - zT' (D)) zd,p{a = 1, z) 

= {a-zT'{D))zd,{zT{D)) 

= {a- zT' (D)) {zT{D) + zT'{D)zd^D) 

= {a- zT (D)) zT{D) + zT'{D){zT{D) - D) 

= azT{D) - zDT'{D). 

If D{z = 1) = 1, p{a = 1,2 = 1) = 1, but ad„p{a = 1,^ = 1) = 0. 
The same kind of algebra can be used to compute the successive derivatives 
{{ada)"'p) (cr = 1, z) for n = 2, ■ ■ -. Again, one can check that this vanishes if 
D{z = 1) = 1. This is of course natural in the non percolating phase. 
However, in the percolating phase, D{z = 1) = 1 — Pqo and one finds 

p(a = 1,2 = 1) = T(l-P„o) 
ad^pia = l,z = l) = aT(l-Poo) - (l-Poo)T'(l-Poo). 

Formally, we can write 

p{a, z) = eiog-[("-^'(^))-9^+(-^(^)-^)9i.]^T(D), (27) 

where now z and D are independant variables, which is true order by order 
in an expansion in powers of logcr. Writing Poo{cr) = 1 — p{cr, z = 1) for the 
profile of the giant component, eq.(|27D can be used to obtain systematically, 
for small values of n, a (rather complicated) formula for {{<jd^)^poo) {cr = 1) 
as a polynomial in Poo, T(l - Poo), T'(l - Poo) ■ ■ ■ , T(")(1 - Poo). 
In particular, trivial example, 

Poo(^= 1) = l-T(l-Poo) ^aPoo. (28) 

On the other hand, for a close to 0, 

I-Poo(ff) oca", (29) 

so the giant component contains all the old vertices, but only a fraction 
(which is exponentially small close the threshold) of the young vertices. That 
means the percolation transition is very inhomogeneous and takes mainly 
place in the part of the graph where it is denser than average. 

7 Conclusions. 

In this study, we have solved a model of evolving random graph which, albeit 
simple, involves an arbitrary in-degree distribution. 



25 



We have described the degree distributions and their local-in-time pro- 
files. By construction, in-degree and out-degree at each vertex are indepen- 
dent. The local out-degree distribution follows an age dependant Poisson 
law, which after integration over ages leads to a geometric global out-degree 
distribution. 

We have also made a detailed analysis of the distribution of component 
sizes, again at the global and the local-in-time level. We have shown the 
validity of the self averaging hypothesis by proving the equivalence with a 
direct exact tree enumeration. 

The parameter controlling the percolation transition has been found, 
quite surprisingly, to be simply the variance of the in-degree distribution, 
in contrast with the analogous static models. 

Below the transition, the large components have a size which scales like 
a power of the total size of the graph. The size of the giant component 
close to the threshold has been computed. It is exponentially small. The 
unusual fiuctuation induced percolation mechanism might be the reason why 
the critical behavior of this models is so different from what is observed in 
the case of static graphs (see |^ |ll], |13|). There, the generic behavior close 
to the transition is a linearly growing giant component. 

We have shown how all these differences could be used to discriminate in 
certain cases between a static and an evolving random graph even when the 
in and out degree distributions are the same for both. 

Among the unanswered questions is a direct description of the profile 
of the giant component. For a Poissonian in-degree distribution, the au- 
thors of 1^ obtained a closed equation which fitted perfectly with numerical 
simulations. The naive adaptation of their argument to the general case is 
straightforward, but gives an incorrect result. It would be desirable to find 
a valid argument for general T, or more modestly to understand why the 
argument in fact works for the Poissonian case. 

A A combinatorial identity. 

We show that the tree distribution, eqs.(^,|l^), leads to the generating func- 
tion formula, eq.([T2|). 

If is a finite (nonempty) set, we denote by 7y the set of trees with vertex 
set V. 

If moreover V is totally ordered , we denote its supremum by sy. If 
moreover \V\ > 2, we define Ty for n = 1, 2, ■ ■ ■ , | V"| — 1 as the set of trees 
with vertex set V such that vertex sy has n neighbors. Note that Ty is non 
empty. 
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IfV— [1, • • • , A;] (with the usual order) we denote 7y by 7^ and if A; > 2 we 
denote Ty by TfP' forn — 1, - ■ ■ ,k — l. We shall sometimes use the shorthand 
notation '4i,...,fc„ = Tk-^ x ■ ■ ■ x Tk^. 

If V is a totally ordered finite (nonempty) set, there is a unique order 
preserving map from V to [1, • • • , so that there is a canonical bijection 
between Ty and T\v\. For the same reason, if moreover |y| > 2 there is a 
canonical bijection between Ty and T^y^ for any n — 1,2, ■ ■ ■ ,\V\ — 1. 

Let V he a totally ordered finite (nonempty) set V with > 2 and let 
ne[l,---,\V\- 1]. Take a tree T e 7^. 

To T we associate the following data 

[i] An n-tuple of positive integers {ki, - ■ ■ , kn) such that 

kl^ Vkn^\V\-l. 

[ii] A sequence (Ti, ■ ■ ■ , T„) e Tfc^,...,fc„- 

[Hi] A sequence (Vi, • • • , Vn) of disjoint subsets of V , each endowed with the 
order induced from that of V , such that 

M {\Vil---AVn\) = {ki,---,k^), 
[h] \/iU---uK = n{M, 

[c] < • • • < sy„ for the order in V . 
[iv\ A sequence {vi, ■ ■ ■ ,Vn) & [I, ■ ■ ■ ,ki\ x ■ ■ ■ x [I, ■ ■ ■ ,kn\, 
as follows : 

Remove from T the vertex sy and the edges incident to it. What remains 
is a forest made of n components. There is a single way to label the cor- 
responding n vertex sets Vi, - ■ • ,Vn so as to satisfy [c], and then we define 
{ki, ■ • ■ , kn) by [a] so we have obtained [i] and [in]. For I e [1, • • • , n] the 
connected component with vertex set is a tree. We define T/ as its canon- 
ical representative in 7^, and by vi the vertex of T; whose preimage in Vi is 
connected to sy in T; this gives [ii] and [iv]. 

Conversely, one can recover T from the data by reversing the procedure. 

We let Pfc^ ... be the set of sequences {Vi, • ■ ■ , Vn) of disjoint subsets of 

V, each endowed with the order induced from that of V, satisfying conditions 

[a] and [b] above. The set obtained when moreover [c] is taken into account 

is denoted by Vl^'^^'^Y . There is a 1 to n! corespondence between Vl'^^^'^Y and 
-pv 

To summarize what we have found, we have put Ty in 
- 1 to 1 correspondence with the disjoint union 

UTfc,,..,.„ X VirXl X[l,---,k,]x..-x[l,---,kn], 
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- 1 to n! correspondence with the disjoint union 

X Vl^...^j,^ X [1, • • • , /Ci] X • • • X [1, • • • , 

where in both cases the union is taken over n-tuples of positive integers 
(/ci, • • • , k„) such that ki -\- • ■ ■ -\- kn — \ V\ — 1. 



From now on, we fix ^ = [1, • • • , /c] for some integer k>2. We assign to 
each tree T e 7^ a weight as follows. Write mj for be the number of edges 
of T connecting i to [1, • • • , i — 1]. Then 



/ da I ■ ■ ■ da^cr'^' 

JO<(Tl<-<(Tfc<l 



Wt = r™„ ■■■TmJ da,--- da^a^-^' - - - ■ 



Write Tir for the first factor and Xx for the integral, so that Wt = Hr^r- 
We change variables and set Ufc = cffc and Uj = aiak foi i — 1, - - - ,k — 1. The 
power of ak in the new integrand is A; — 1 + A;ci; — mi — • • • — = ka so 
integration over 5"^ leads to Xr — Y^ka with 



a-ruk-i 



Xt= / da,---dak-iat"''---alZl 

Jo<ai<-<ak-i<l 

To avoid ambiguities when several trees are used at the same time, we shall 
sometimes write mj instead of mj. 

Suppose that T e T^, which amounts to set n — ruk- We want to 
express Wx in terms of its decomposition, in fact in term of the n trees 
(Ti, • • • , T„) G Tfc^ X ■ • ■ X 7fc^ with ki + - - - + kn = \V\ — 1, and of the 
partition Vi U • • • U (it turns out that Wx does not depend on the choice 
of one vertex in each V^). 

The decomposition procedure associates to each i e [1, • • • , A; — 1] one of 
the trees Ti, • • • , T„, say T/ and a vertex i' e [1, • • • , h] in T^. By construc- 
tion of the decomposition, if j G [1, - - - ,i — 1] is such that is an edge of 
T then i and j have the same T^, f G [1, - ■ ■ ,i' — 1] and {i' is an edge of 
T;. Hence mJ — mj,^ and Tir = TnHri ■ ■ ■'^t„ has a simple multiplicative 
behavior. 

Our aim is now to show that when (Ti, • • • , T„) (and then automatically 
{ki,---,kn)) are fixed ir = Xx^ • • -Xx^. 

We introduce another tree. T, whose decomposition is made of the same 
trees (Ti, ■ ■ ■ , T„) as T, but with Vi = [1, - - - , ki],V2 = [ki + 1, - - - , /C1 + /C2], • • • 
and Vi = ki,V2 = A'l + /o^, ■ • •. Write rhj for the number of edges of T 
connecting i to [1, ■ ■ • , i — 1]. 
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There is a unique permutation, say A, of [1, ■ ■ ■ , A; — 1] which maps Vi into 
Vi, V2 into V2, ■ ■ ■, and is strictly increasing in each. Then rrii = rhx(i) for 
each i E [1, ■ ■ ■ , k — 1]. If we set = (TA(j) we obtain 

It = dai ■ ■ ■ dak-icx^ ■■■(Tk-i 

Write R\ for the region of integration < o-\(i) < ■ ■ • < (^\{k-i) < 1- 

Conversely, (fci, ■ ■ ■ , fc„) being kept fixed, if (Vi, ■ ■ ■ , Vn) describes 'Pij'...'^^ 
each permutation A of [1, ■ ■ ■ , — 1] such that A~^ is strictly increasing when 
restricted to Vi, ■ ■ ■ , appears exactly once. If R be the union of all such 
i?A's (the intersection of different R\s is of measure 0), one checks that 
((Ti, ■ ■ ■ , (Tfc_i) is in R if and only if < cxi < ■ ■ ■ < cxfc^ < 1, < ak^+i < 

This shows that when (Ti, ■ ■ ■ , T„) are fixed ^^[i, .fc] = ■ ■ '^r„- 

If we recall moreover that Hr = TnHri ■ ■ ■ 'H^^ and that Ty is in 1 to n\ 
correspondence with the disjoint union 

U(fci,.,fe„)rfc,,.,fc„ X X [1, ■ ■ ■ , /ci] X • ■ ■ X [1, ■ ■ ■ , A;„], 

we obtain as an immediate consequence that 

(1 + to) 5^ wt = 5^ ^^i^^i ■ ■ ■ 

If we define W'^^^ = EreT, = En Etgt;" ^r, it is plain that Efc W^*^^-^^ 
satisfies eq.(|l^), which has only one formal power series solution vanishing 
at 2 = 0. QED. 
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